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PROGRAMMABLE LOGIC DEVICE 

Cross-Reference to Related Applications 

The present invention may relate to co-pending 
application Attorney Docket No. 0325.00293, filed concurrently, 
which is hereby incorporated by reference in its entirety. 

Field of the Invention 

The present invention relates to programmable logic 
devices generally and, more particularly, to a programmable logic 
device architecture that may combine the high-speed, predictable 
timing, and ease of use of a complex programmable logic device 
(CPLD) with the high density and low power of a field programmable 
gate array (FPGA) . 

Background of the Invention 

Traditionally there are two types of programmable logic 
architectures: complex programmable logic device (CPLDs) and field 
programmable gate arrays (FPGAs) . The CPLD can be constructed as 
a one-dimensional array of logic blocks made of 16 macrocells and 
a product term array connected through a single central 
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interconnect scheme . The CPLD achieves high performance by being 
able to complete a complex logic function in a single pass of the 
logic array and has predictable timing by having every output or 
I/O pin connected to every logic block input through the central 
interconnect structure. The CPLD can be non-volatile by using an 
EEPROM process . 

However, the architecture of the conventional CPLD has 
disadvantages. A complex process technology limits performance and 
increases cost. A high standby power limits capacity and 
applications. The conventional CPLD has no available on-chip RAM. 
The maximum capacity of the conventional CPLD is limited by 
interconnect structure performance, power, technology and die cost. 
The core voltages, I/O voltages, and I/O standards of the 
conventional CPLD are not flexible. The I/O cells need a 
synchronous output enable (OE) to support a synchronous circuit 
architecture with minimal bus latency (e.g., as found in NoBL™ 
SRAMs manufactured by Cypress Semiconductor Corp. or ZBT™ devices 
manufactured by Integrated Device Technology) memory. 

An FPGA architecture is constructed from a two 
dimensional array of logic blocks called CLBs . The CLBs are made 
from 4-input look-up- tables (LUTs) and flip-flops. The LUTs can be 



0325.00292 
CD99047 

used as distributed memory blocks. The CLBs are connected by a 
segmented interconnect structure. The FPGA architecture supports 
a low standby power and the LUTs can use a simple logic CMOS 
process. Since the two-dimensional array of CLBs and the segmented 
interconnect structure are scalable, the FPGA can achieve high 
densities . 

However, the architecture of the FPGA has disadvantages. 
A volatile process requires a FLASH/EEPROM to be added to the 
design. The segmented routing architecture limits performance and 
makes timing unpredictable. Implementing a dual port or FIFO 
memory with LUTs is slow and inefficient. A complex "design-in- 
process" is required because the conventional FPGAs do not have 
predictable timing, short compile times, in-system- 
reprogrammability (e.g., ISR®, a registered Trademark of Cypress 
Semiconductor Corp.) or pin fixing. The core voltage of the 
conventional FPGA is (i) not flexible and (ii) driven by the 
current process. The conventional FPGA makes product migration 
very difficult and does not support full JTAG boundary scan and 
configuration . 
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Summary of the Invention 

The present invention concerns a programmable logic 
device comprising one or more horizontal routing channels, one or 
more vertical routing channels, and one or more logic elements. 
Each of the logic elements may be configured to connect between one 
of the horizontal routing channels and one of the vertical routing 
channels. The logic elements may comprise a logic block array and 
a memory block. 

The objects, features and advantages of the present 
invention include providing a programmable logic device 
architecture that may (i) implement advanced process technology 
without an EEPROM and/or high voltage; (ii) have a separate off 
chip non-volatile storage device within the same package; (iii) 
reduce power consumption; (iv) provide configurable single port 
RAMs, dual port RAMs and/or FIFOs; (v) provide configurable I/O 
cells; (vi) bank I/O cells to support multiple I/O standards within 
the same product; (vii) use a hierarchical PIM based routing 
structure for a high-speed, compact routing structure; (viii) have 
a simple timing model; (ix) provide a dual port memory with 
dedicated logic and arbitration or a FIFO memory with dedicated 
logic and flags, that may improve memory performance and increase 
the capacity of the device; (x) have a routing structure that may 
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not require a place and/or route step; (xi) provide a STAPL 
interface to allow for a part to be reconfigured in-system; (xii) 
provide an on-chip voltage regulator for a flexible core voltage; 
and/or (xiii) support a JTAG boundary scan (including INTEST) for 
5 easy debugging. 

Brief Description of the Drawings 

These and other objects, features and advantages of the 
*3 present invention will be apparent from the following detailed 
l'GJ description and the appended claims and drawings in which: 

is 5 

^ FIG. 1 is a block diagram of a preferred embodiment of 

the present invention; 

\& 

fjj FIG. 2 is a block diagram of a logic element of FIG. 1; 

u 

C3 FIG. 3 is a detailed block diagram of a logic block array 

1# of FIG. 2; 

FIG. 4 is a detailed block diagram of a logic block of 

FIG. 3; 

FIG. 5 is a schematic diagram of a product term 
allocation block of FIG. 4; 
20 FIG. 6 is a schematic diagram of a macrocell circuit of 

FIG. 4; 

5 
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FIG. 7 is a detailed block diagram of a configurable 
memory block of FIG. 3; 

FIG. 8 is a detailed block diagram of a configurable 
memory block of FIG. 2; 

FIG. 9 is a detailed block diagram of a configurable 
memory of FIG. 8 configured as a FIFO; 

FIG. 10 is a detailed block diagram of a configurable 
memory of FIG. 8 configured as a dual -port memory; 

FIG. 11 is a detailed block diagram of a clock PLL 
circuit of FIG. 1; 

FIG. 12 is a detailed block diagram of an I/O block of 

FIG. 1; 

FIG. 13 is a detailed schematic of an I/O cell of FIG. 

12; and 

FIG. 14 is a block diagram illustrating I/O blocks of 
FIG. 1 combined to form I/O banks. 



Detailed Description of the Preferred Embodiments 

Referring to FIG. 1, a block diagram of a programmable 
logic device (PLD) 100 is shown in accordance with a preferred 
embodiment of the present invention. The PLD 100 may be 
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implemented using an advanced CMOS logic process. The PLD 100 may- 
be implemented as a volatile or non-volatile device. When a non- 
volatile device is desired, the PLD 100 may be implemented with a 
separate non-volatile storage device within the same package. The 
PLD 100 may support, in one example, an external core voltage of 
3.3V, 2.5V, or 1.8V through the use of an internal voltage 
regulator. The range of core voltages may provide for the 
migration of previous applications to the PLD 100 without requiring 
a change in a power supply scheme . 

The PLD 100 may comprise, in one example, a number of 
logic elements 102a- 102n, a number of horizontal routing channels 
104a-104n, a number of vertical routing channels 106a-106n, a 
number of I/O blocks 108a-108n, a PLL/CLK multiplexer circuit 110, 
and a control circuit 112. The logic elements 102a-102n may be 
placed in a two dimensional array (e.g., 2x3, 3x3, 3x4, 4x4, 4x5, 
5x5, 5x6, 6x6, etc.) . The array sizes may be adjusted to meet the 
design criteria of a particular application. Each of the logic 
elements 102a-102n is generally connected to (i) one of the 
horizontal routing channels 104a-104n in the x axis and (ii) one of 
the vertical routing channels 106a-106n in the y axis. The routing 
channels 104a- 104n and 106a- 106n generally form a row and column, 
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non-segmented routing matrix. The logic elements 102a- 102n may be 
configured to receive a global clock signal (e.g., GCLK) and a 
control signal (e.g., C a . n ) . The signal GCLK may comprise one or 
more clock signals. In one example, the signal GCLK comprises four 
clock signals (e.g., GCLK[3:0]). The signal C a . n may be n-bits 
wide, where n is an integer, and used to control the configuration 
of the PLD 100. Each bit of the signal C a . n may use fewer than n 
bits. 

Each of the I/O blocks 108a-108n may be connected to an 
end of the routing channels 104a-104n and 106a-106n. The I/O 
blocks 108a-108n may have (i) an input that may receive a global 
control signal (e.g., GCNTRL) , and (ii) an input the may receive 
the signal GCLK [3:0]. The global control signal GCNTRL may 
comprise one or more control signals. In one example, the signal 
GCNTRL may comprise four control signals (e.g., GCNTRL [3:0]) . The 
separate I/O blocks 108a- 108n may provide for improved design 
"fitting", pinout flexibility, and I/O performance. The I/O blocks 
108a- 108n may be combined to form I/O banks that may be configured 
to support all of the current I/O standards within the 1.5V to 3.3V 
range . 



8 
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The PLD 100 may have, in one example, four dedicated 
inputs configured to receive the four global I/O control signals 
GCNTRL[3:0] , four dedicated clock inputs configured to receive four 
input clock signals (e.g., INCLK[3:0]), an input configured to 
receive a test clock signal (e.g., TCK) , an input configured to 
receive a command signal (e.g., INTEST) , and an input configured to 
receive an enable signal (e.g., GINTSCEN) . However, other numbers 
of dedicated pins may be implemented accordingly to meet the design 
criteria of a particular application. The signals GCNTRL[3:0] are 
generally presented to control inputs of the I/O blocks 108a-108n. 
The signals GCNTRL[3:0] may be used, in one example, for fast I/O 
control (e.g., reset, output enable and/or clock enable). 

The signals INCLK[3:0], TCK, INTEST, and GINTSCEN are 
generally presented to the PLL block 110. The PLL block 110 may be 
configured to generate the four global clock signals GCLK[3:0] in 
response to the signals INCLK[3:0] and C a . n . The global clock 
signals GCLK[3:0] may be presented to a clock input of every 
register in the PLD 100. The PLL block 110 may be configured to 
present any or all of the signals GCLK[3:0] as a multiplied, 
divided, phase shifted, or de-skewed version of the signal 
INCLK[0] . 
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The control block 112 may configure the PLD 100 to 
provide (i) support for a JTAG boundary scan, the JTAG programming 
standard STAPL, JTAG INTEST, and/or full scan and (ii) support for 
several configurations that may use compression/de-compression to 
reduce storage requirements and error checking to detect problems. 
The configurations may include a master serial, a master parallel, 
and/or a JTAG interface. The control block 112 may be configured 
to generate the control signal C a _ n in response to an external 
configuration signal (e.g., CONFIG) . The signal C a _ n may be used to 
configure signal pathways and components within the PLD 100. 

Referring to FIG. 2, a block diagram of the logic element 
102 is shown. The logic element 102 may comprise, in one example, 
a logic block array 114 and a configurable memory 116. The logic 
block array 114 may have (i) an input 117 that may receive the 
signal C a _ n (ii) an input /output 118 that may receive/present 
signals from/to the horizontal routing channel 104, (iii) an input 
119 that may receive the signal GCLK[3:0] , and (iv) an input/output 
12 0 that may receive/present signals from/ to the vertical routing 
channel 106. The configurable memory 116 may have (i) an input 121 
that may receive the signal C a . n , (ii) an input /output 122 that may 
receive/present signals from/to the horizontal routing channel 104, 
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(iii) an input 123 that may receive the signal GCLK[3:0] and (iv) 
an input/output 124 that may receive/present signals from/to the 
vertical routing channel 106. The configurable memory 116 may be 
configured, in one example, as (i) an asynchronous dual port 
memory, (ii) a synchronous dual port memory, or (iii) a synchronous 
FIFO memory. The memory 116 may be configured as xl, x2 , x4, or x8 
wide memory. The memory 116 may be placed in the routing channels 
to achieve high performance with the I/O blocks 108. 

Referring to FIG. 3, a more detailed block diagram of the 
logic block array 114 is shown. The logic block array 114 may 
comprise, in one example, a programmable interconnect matrix (PIM) 
126, eight logic blocks 128a-128h, and two memory blocks 130a and 
13 0b. However, other numbers of logic blocks 12 8 and memory blocks 
13 0 may be implemented accordingly to meet the design criteria of 
a particular application. The PIM 126 is generally connected via 
the input/output 118 to the horizontal channel 104 and via the 
input/output 120 to the vertical channel 106. The input/outputs 
118 and 120 may comprise, in one example, 64 inputs and 72 outputs 
each. The greater number of outputs is generally used for the 
outputs of the memories 13 0a and 13 0b. The PIM 12 6 may have 3 6 
outputs that may present signals to the logic block 128a and 16 
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inputs that may receive signals from the logic block 128a. The PIM 
126 may have similar connections to the logic blocks 128b-128h. 
The PIM may have 25 outputs that may present signals to the memory 
13 0a and 8 inputs that may receive signals from the memory 13 0a. 
The PIM 12 6 may be similarly connected to the memory 13 0b. Since 
all the signals traverse the same path, the PIM 126 is generally 
fast and provides simple timing. The PIM 12 6 may provide redundant 
paths for pin-locking flexibility and maximum capacity. 

The signals GCLK[3:0] and C a _ n are generally presented to 
an input of the logic blocks 128a- 128h and the memories 130a and 
130b. The logic blocks 128a-128d may be connected by a carry 
signal (e.g., CARRY 1 ) to form a carry chain. The logic blocks 
128e-128h may be similarly linked by a carry signal (e.g. , CARRY 2 ) . 
Up to four logic blocks may be cascaded in a carry chain. The 
carry chain may start or stop at any macrocell within the carry 
chain. The carry chains may implement efficiently adders, 
comparators, and parity generators. 

The two memory blocks 130a and 130b may be implemented, 
in one example, as highly configurable RAM. The memory blocks 13 0a 
and 13 0b may be placed inside the logic block array 114 to achieve 
a high performance with the local logic blocks . The memory blocks 

12 
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130a and 13 0b may be configured as asynchronous, synchronous input, 
synchronous output, pipelined, xl, x2 , x4 , x8, look up tables, 
and/or ROM code. In one example, the memory blocks 13 0a and 13 0b 
may be implemented as 8K-bit memories for a total of 16K-bits of 
memory . 

Referring to FIG. 4, a more detailed block diagram of a 
logic block 128 is shown. The logic block 128 may comprise, in one 
example, a product term array 132, a product term matrix 134, and 
sixteen macrocells 136a-136p. To reduce power consumption, the AND 
plane and OR plane of the logic block 128 generally use complex 
CMOS logic gates instead of the conventional high power sense amps. 
The product term array 132 may be configured, in one example, to 
receive thirty-six input signals from the PIM 126. The product 
term array 132 may be configured, in one example, to generate (i) 
eighty product terms that may be presented to the product term 
matrix 134, (ii) a product term clock signal (e.g., PTCLK) that may 
be presented to each of the macrocells 136a-136p, and (iii) two 
product term signals (e.g., AP/AR0 and AP/AR1) that may be 
presented to the macrocells 136a-136p in response to the thirty-six 
input signals. The signals AP/AR0 and AP/AR1 may be used as reset 
or preset signals to any of the macrocells 136a-136p. 
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The product term matrix 134 may have, in one example, 
sixteen outputs 138a-138p that may present a signal to each of the 
sixteen macrocells 136a- 136p in response to one or more of the 
eighty input signals. Each of the outputs 138a-138p may be 
connected to an input 140a-140p, respectively, of the macrocells 
136a-136p. The signals GCLK[3:0] and C a _ n are generally presented 
to the macrocells 136a-136p. Each macrocell 136a-136p may have an 
output 142a- 142p that may present an output signal to an input of 
the PIM 126. A select signal (e.g., SELIN) may be presented to the 
macrocells 136a-136p. The macrocells 136a-136p may generate a 
select signal (e.g., SELOUT) that may be presented at an output of 
the logic cell 128. 

Referring to FIG. 5, a partial circuit diagram of the 
product term matrix 134 of FIG. 4 is shown. The product term 
matrix 134 may comprise, in one example, sixteen OR gates 144a- 
144p. The OR gates 144a-144p may be configured to logically 
combine up to sixteen product terms. However, other types of gates 
with other numbers of inputs may be implemented accordingly to meet 
the design criteria of a particular application. The product term 
matrix 134 may provide for product term sharing and steering that 
may be important for partitioning and/or fitting algorithms. 

14 
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Referring to FIG. 6, a circuit diagram of a macrocell 136 
is shown. The macrocell 136 may comprise, in one example, a gate 
146, a register 148, and a gate 150. The gate 146 may be 
implemented, in one example, as an XOR gate. The gate 150 may be 
5 implemented, in one example, as an AND gate. However, the gates 
146 and 150 may be implemented as other gates to meet the design 
criteria of a particular application. The gate 144 may present a 
signal to a first input of the gate 146 in response to up to 

i ^ 

'S sixteen product term signals. A signal selected from a first carry 

U J. 

1M product term (e.g., CPT0) , a second carry product term (e.g., 

-J- CPT1) , a portion of the control signal C a _ n , a signal presented at 

f a Q output of the register 148, and a signal presented at a Qb 

r 

\ l % output of the register 148 may be presented to a second input of 

S =5. 

3 the gate 146. The signal CPT0 or the signal CPT1 may be selected 

. 

- z 

15 in response to a select signal (e.g., SELIN) and the signal C a _ n . 
The selected signal (e.g., either CPT0 or CPT1) may be used as a 
select signal (e.g., SELOUT) . The gate 146 may have an output that 
may present a signal to an input of the register 148. The 
macrocell 13 6 may present the signal at the input of the register 

20 148 or the signal presented at the Q output of the register 148 as 
an output signal. The register 148 may be (i) clocked by the 

15 
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product term clock PTCLK, one of the four global clocks GCLK[3:0], 



or a digital complement thereof, (ii) reset by one of the product 



terms AP/AR[1:0], and (iii) preset by one of the product terms 



AP/AR[1:0]. Equations may be inverted before the register 148. 



5 The macrocell 136 may be used to create T and /T equations. A 



carry chain made up of macrocells may allow implementation of 



adders, comparators, and parity generators. 



Referring to FIG. 7, a circuit diagram illustrating the 
T: f memory 13 0 of FIG. 3 is shown. The memory 13 0 may have a data 



lfk input that may receive a data signal (e.g., DIN), an address input 

--J that may receive an address signal (e.g., ADDR) , an enable input 

%J 

s that may receive an enable signal (e.g., WE), a clock input that 
may receive the signal GCLK[3:0], a clock input that may receive a 

I J 
£2 

^ first local clock signal (e.g., LOGICLK0) , a clock input that may 

15 receive a second local clock signal (e.g., LOGICLK1) , a control 



input that may receive a reset signal (e.g., OUTREGCLR) , a 



configuration input that may receive the signal C a . n , and a data 



output that may present a data signal (e.g., DOUT) . The signals 



DIN and DOUT may be, in one example, 8 bits wide. The signal ADDR 



2 0 may be, in one example, 13 bits wide. 



16 



0325.00292 
CD99047 

The memory 130 generally comprises a memory array 152, a 
register 154, a multiplexer 156, a register 158, a multiplexer 160, 
a register 162, a write pulse generator 164, a multiplexer 166, a 
register 168, a multiplexer 170, a multiplexer 172, a multiplexer 
174, a multiplexer 176, a multiplexer 178, and a power down circuit 
180. The signal DIN is generally presented to an input of the 
register 154 and a first input of the multiplexer 156. The 
register 154 may have a clock input that may receive a clock signal 
(e.g., INPUT_CLK) and an output that may be connected to a second 
input of the multiplexer 156. The multiplexer 156 generally selects 
either (i) the signal DIN or (ii) the output of the register 154 as 
a data input signal to the memory array 152 in response to the 
signal C a . n . 

The signal ADDR is generally presented to an input of the 
register 158 and a first input of the multiplexer 160. The 
register 158 may have a clock input that may receive the clock 
signal INPUT_CLK and an output that may be connected to a second 
input of the multiplexer 160. The multiplexer 160 generally selects 
either (i) the signal ADDR or (ii) the output of the register 158 
as an address input signal to the memory array 152 in response to 
the signal C a _ n . 

17 
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The signal WE is generally presented to an input of the 
register 162 and a first input of the multiplexer 166. The 
register 162 may have a clock input that may receive the clock 
signal INPUT_CLK and an output that may be connected to an input of 
the write pointer 164. An output of the write pulse generator 164 
is generally connected to a second input of the multiplexer 166. 
The multiplexer 166 generally selects either (i) the signal WE or 
(ii) the output of the write pulse generator 164 as an enable input 
signal to the memory array 152 in response to the signal C a . n . 

The memory array 152 generally has an output that 
generally presents a data signal to an input of the register 168 
and a first input of the multiplexer 170. The register 168 may 
have a clock input that may receive a clock signal (e.g., OUTCLK) 
and an output that may be connected to a second input of the 
multiplexer 170. The multiplexer 170 generally selects either (i) 
the data signal from the memory array 152 or (ii) the output of the 
register 168 as the signal DOUT in response to the signal C a _ n . 

The multiplexers 172 and 174 may be configured to select 
one of the global clock signals GCLK[3:0], the local clock signal 
LOGICLK0, or a complement thereof as the signal INPUT_CLK in 
response to the signal C a . n . The multiplexers 176 and 178 may be 

18 
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configured to select one of the global clock signals GCLK[3:0] , the 
local clock signal LOGICLK1, or a complement thereof as the signal 
OUTCLK in response to the signal C a . n . 

The power down circuit 180 may power down the memory 13 0 
5 in response to the signal C a _ n . 

The memory 130 may be configured, in one example, as 
asynchronous, synchronous inputs, synchronous outputs, pipelined, 
xl, x2, x4, x8, look up tables, and/or ROM code. The output of the 
% 'i memory 130 may be registered during asynchronous operation. The 

s sa 

lfH memory 13 0 may be, in one example, (i) implemented as an 8K-bit 

CO 

^ J memory and (ii) configured to have a block size of 8Kxl, 4Kx2, 

o 

? 2Kx4, or 1Kx8. Two or more memories 130 may be cascaded to provide 

\% wider and/or deeper memory blocks. The memories 130 of different 

L y 

3 logic block arrays 114 may be cascaded. During synchronous 
15 operation, the inputs and output of the cluster memory 13 0 may be 
independently clocked. 

Referring to FIG. 8, a detailed block diagram of the 
configurable memory 116 is shown. The configurable memory 116 may 
be implemented similarly to a circuit 100 that may be found in the 
20 co-pending application Attorney Docket No. 0325.00293, filed 
concurrently, which is hereby incorporated in its entirety. The 

19 
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memory 116 may comprise a memory array 182, a register 184, a 
register 186, a read pointer 188, a register 190, a write pulse 
generator 192, a register 194, a register 196, a write pointer 198, 
a register 200, a write pulse generator 202, a register 204, a 
5 register 206, and a dedicated FIFO flags and logic block 208. The 
memory array 182 may be implemented, in one example, as an 
asynchronous dual port memory array. The memory array 182 may have 
a configuration input 2 09 that may receive the signal C a . n , a first 

C3 

J- port (e.g., A) and a second port (e.g., B) . The port A may have a 

im data input 210 that may receive (i) a data signal (e.g., DINH) from 

CO 

"~4 the horizontal routing channel 104 or (ii) a data signal (e.g., 

O 

DINV) from the vertical routing channel 106. The signal DINH or 

?* 

TL1 

ij the signal DINV may be presented directly to the input 210 or 

C3 

q through the register 184. The port A may have an address input 212 

=. 

15 that may receive (i) an address signal (e.g., ADDRH) from the 
horizontal routing channel 104 or (ii) an address signal from the 
read pointer 188. The signal ADDRH may be presented directly to 
the input 210 or through the register 190. The port A may have an 
enable input 214 that may receive (i) an enable signal (e.g., WEA- 

20 ENR_H) from the horizontal routing channel 104 or (ii) an enable 
signal (e.g., WEA-ENR__V) from the vertical routing channel 106. 

20 
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The signal WEA-ENR_H or the signal WEA-ENR__V may be presented 
directly to the input 214 or through the register 192 and the write 
pulse generator 194. The registers 184, 186, and 190 may have a 
clock input that may receive (i) one of the global clock signals 
GCLK[3:0], a logic clock signal (e.g., PCLKAH) from the horizontal 
routing channel 104, or a logic clock signal (e.g., PCLKAV) from 
the vertical routing channel 106. The polarity of the clock signal 
received may be selected. 

The port B may have a data input 216 that may receive (i) 
the data signal DINH or (ii) the data signal DINV. The signal DINH 
or the signal DINV may be presented directly to the input 216 or 
through the register 194. The port A may have an address input 218 
that may receive (i) an address signal (e.g., ADDRV) from the 
vertical routing channel 106 or (ii) an address signal from the 
write pointer 198. The signal ADDRV may be presented directly to 
the input 210 or through the register 196. The port A may have an 
enable input 220 that may receive (i) an enable signal (e.g., WEB- 
ENR_H) from the horizontal routing channel 104 or (ii) an enable 
signal (e.g., WEB-ENR__V) from the vertical routing channel 106. 
The signal WEB - ENR_H or the signal WEB - ENR_V may be presented 
directly to the input 220 or through the register 2 00 and the write 
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pulse generator 202. The registers 194, 196, and 200 may have a 
clock input that may receive (i) one of the global clock signals 
GCLK[3:0], a logic clock signal (e.g., PCLKBH) from the horizontal 
routing channel 104, or a logic clock signal (e.g., PCLKBV) from 
the vertical routing channel 106. The polarity of the clock signal 
received may be selected. 

The port A may have a data output 222 that may present a 
data signal (i) to the horizontal routing channel 104, directly or 
through the register 2 04 and/or (ii) to the vertical routing 
channel 106, directly or through the register 206. The port B may 
have a data output 224 that may present a data signal (i) to the 
horizontal routing channel 104, directly or through the register 
204 and/or (ii) to the vertical routing channel, directly or 
through the register 206. The register 204 may have a clock input 
that may receive (i) one of the global clock signals GCLK[3:0], 

(ii) the clock signal PCLKAH, or (iii) the clock signal PCLKAV. 
The polarity of the clock signal received may be selected. The 
register 2 06 may have a clock input that may receive (i) one of the 
global clock signals GCLK[3:0], (ii) the clock signal PCLKBH, or 

(iii) the clock signal PCLKBV. The polarity of the clock signal 
received may be selected. 
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The dedicated logic block 208 may be configured to 
present port arbitration signals and/or FIFO flag signals to the 
horizontal routing channel 104 and the vertical routing channel 
106. 

5 The memory 116 may change configurations in response to 

the signal C a _ n . The memory 116 may be configured as an 
asynchronous dual port RAM, a synchronous dual port RAM, a 
synchronous input RAM, a synchronous output RAM or a FIFO memory. 
% t However, other configurations of memory may be implemented to meet 
llQ] the design criteria of a particular application. The memory 116 
s 4 may also configure a width of data stored in response to the signal 
■ C a . n . The memory 116 may have, in one example, a data width of xl, 
JJl x2, x4 or x8 . However, other widths of data may be implemented to 

E. _5 

meet the design criteria of a particular application. The memory 
15 116 may be implemented, in one example, as a 4K-bit dual port 
array. However, other size arrays may be implemented accordingly 
to meet the design criteria of a particular application. In one 
example, the memory 116 may be configured as a 4Kxl, 2Kx2, 1Kx4, or 
512x8 array. However, other configurations may be implemented 
20 accordingly with an array of a different size. Two or more 
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memories 116 may be cascaded using the routing channels 104 and 106 
to form larger memory blocks. 

Referring to FIG. 9, a diagram of a memory 116' 
illustrating a FIFO configuration is shown. When used in the FIFO 
configuration, the width of the FIFO may be expanded using two or 
more memories 116. However, the depth of the FIFO will generally 
be limited to the size of a single memory 116. For example, a 4K- 
bit memory 116 will generally yield a FIFO with a maximum depth of 
4K bits. The FIFO implemented with one or more memories 116 may be 
configured to receive data signals, a write enable signal and a 
read enable signal from either the horizontal routing channel 104 
or the vertical routing channel 106. The FIFO may have independent 
read and write clocks. Clock signals for read, write, and 
synchronous activities may be chosen from the four global clocks 
GCLK[3:0] , the two logic clock signals PCLKA_H and PCLKB_H from the 
horizontal channel 104, and the two logic clock signals PCLKA_V, 
and PCLKB_V from the vertical channel 106. The polarity of the 
clock signals may also be selected. The FIFO configuration of the 
memory 116 may present data output and flag signals to both the 
horizontal routing channel 104 and the vertical routing channel 
106. 

24 
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Referring to FIG. 10, a diagram of a memory 116" 
illustrating a dual -port configuration is shown. In the dual port 
configuration, the memory 116 may have a port A and a port B. The 
port A and the port B may operate in a synchronous or asynchronous 
mode. However, both ports A and B will generally operate in the 
same mode. Port A generally receives data signals, address 
signals, and control signals from the horizontal channel 104 and 
presents data signals to the horizontal channel 104. Port B 
generally receives data signals, address signals, and control 
signals from the vertical channel 106 and generally presents data 
signals to the vertical channel 106. The ports A and B will 
generally be arbitrated so that the port A will generally win all 
conflicts. In the dual port configuration, two or more memories 
116 may be cascaded to provide a memory block with a larger width 
and/or depth. The clock signals for read, write, and synchronous 
activities may be chosen from the four global clocks GCLK[3:0] or 
from the four logic clocks PCLKA_H, PCLKB_H, PCLKA__V, and PCLKB_V 
from the channels 104 and 106. The polarity of the clocks may also 
be selected. 

Referring to FIG. 11, a block diagram of the PLL/CLK 
multiplexer circuit 110 is shown. The circuit 110 may comprise a 
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phase lock loop (PLL) 230, a multiplexer 232, a delay circuit 234, 
a number of dividers 236a-236n, a number of intermediate 
multiplexers 238a-238n, and a number of output multiplexers 240a- 
24 On. The signal INCLK(O) may be presented to a source clock input 
of the PLL 230. The PLL 230 may have (i) a number of outputs 242a- 
242n that may present a number of output signals (e.g., CLK_0, 
CLK_45, CLK_90, CLK_135, CLK_180, CLK_225, CLK_270, CLK_315, etc.) 
having different phases in response to the signal INCLK(0)and a 
feedback signal. The feedback signal may be (i) the signal 
INCLK(l) , (ii) an output signal (e.g., FB) or (iii) a signal 
generated by the clock delay circuit 234 in response to the signal 
FB. The PLL 230 may be configured to multiply the frequency of the 
signal INCLK(O) by a multiplier factor (e.g., 1, 2, 4, etc.) 
selected in response to the signal C a _ n . The PLL 23 0 may have an 
output that may present a status signal (e.g., PLL__LOCK) that may 
indicate when the PLL is locked. 

The signal CLK_0 may be presented to the dividers 236a- 
236n. The dividers 236a-236n may be configured to generate a 
signal (e.g., DIV_A-DIV_N) that may be presented to an input of the 
multiplexers 238a-238n, respectively, by dividing the signal CLK_0 
by a division factor (e.g., 1, 2, 3, 4, 5, 6, 8, 16, etc.). The 
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division factor may be selected in response to the signal C a . n . The 
signal INCLK(l) may be used as a board de-skew reference. When the 
signal INCLK(l) is used as a de-skew reference, one of the global 
clocks GCLK[3:0] may be presented from a macrocell 244 to the 
INCLK(l) input. The signals CLK_45, CLK_90, CLK_135, CLK_180, 
CLK_225, CLK_270, and CLK_315 may be presented to an input of the 
intermediate multiplexers 238a-238n. 

The multiplexers 238a and 240a may select, in one 
example, one of the signals DIV_A , CLK_45, CLK_90, CLK_135, 
CLK_180, CLK_225, CLK_270, CLKJ315, TCK, and INCLK(O) as the signal 
GCLK(O) in response to the signals C a _ n , INTEST, and GINTSCEN. The 
multiplexers 238b and 240b may select, in one example, one of the 
signals DIV_B , CLK_45, CLK_90, CLK_135, CLK__180, CLK_225, CLK_270, 
CLK_315, TCK, and INCLK(l) as the signal GCLK(l) in response to the 
signal C a . n , INTEST, and GINTSCEN. The multiplexers 238c and 240c 
may select, in one example, one of the signals DIV_C , CLK_4 5, 
CLK_90, CLK_13 5, CLK_180, CLK_225, CLK_2 70, CLK_315, TCK, and 
INCLK(2) as the signal GCLK(2) in response to the signal C a _ n , 
INTEST, and GINTSCEN. The multiplexers 238n and 240n may select, 
in one example, one of the signals DIV_N , CLK_4 5, CLK_90, CLK_135, 
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CLK_180, CLK_225, CLK_2 70, CLK_315, TCK, and INCLK(3) as the signal 
GCLK(3) in response to the signal C a _ n/ INTEST, and GINTSCEN. 

The circuit 110 may have a multiplexer 246 that may 
select a status signal (e.g., PLL_L0CK) or a normal I/O signal for 
presentation to a bonding pad 248 in response to the signal C a _ n . 
The macrocell 244 may be configured to select the signal GCLK(O) , 
the signal GCLK(l) , the signal GCLK(2) , or the signal GCLK(3) as an 
output clock signal (e.g., CLK-I/O) at a bonding pad 250. 

Referring to FIG. 12, a block diagram of the I/O block 
108 of FIG. 1 is shown. The I/O block 108 generally comprises an 
output PIM 252, an output control channel 254 and a number of I/O 
cells 256a-256n. In one example, the I/O block 108 may comprise 21 
I/O cells. The output PIM 252 may connect the I/O cells 256a-256n 
to the routing channels. The I/O cells 256a-256n may have (i) an 
input that may receive the signal C a _ n , (ii) an input that may 
receive the global I/O control signal GCNTRL [3 : 0] , (iii) an input 
that may receive the global clock signal GCLK[3:0], (iv) an input 
that may receive a control signal (e.g. OCC[11:0] from the output 
control channel 254, (v) an input that may receive a signal from 
the output PIM 252, (vi) an output that may present a signal to an 



28 



0325.00292 
CD99047 

input of the routing channel 104 or 106, and (vii) an input/output 

that may be connected to a bonding pad. 

Referring to FIG. 13, a block diagram of an I/O cell 256 

is shown. The I/O cell 256 may comprise a register 258, an OE 
5 register 260, a tristate output buffer 262, a programmable slew 

rate control 2 64, and a programmable bus hold 2 66. The register 

258 may be used as an input register or an output register. The OE 
£3 register 260 may provide compatibility for a synchronous circuit 

o 

architecture with minimal bus latency memory. The I/O cell 256 may 



lf| be configured for a number of I/O standards that may include PCI, 
O SSTL, HSTL, GTL+, LVTTL, LVCMOS, LVCMOS2 , LVCMOS18, HSTL (I, II, 

2 

K III, IV), SSTL2(I, II), SSTL3 (I, II). 

iy 
Id 

p The register 2 58 may have an enable input that may 

■- =3 

U receive (i) one of the global I/O control signals GCNTRL[3:0] , (ii) 
15 a signal from the output control channel 254 or (iii) a supply 
voltage (e.g., VCC) . The registers 258 and 260 may have a clock 
input that may receive one of the global clock signals GCLK[3:0]. 
The polarity of the clock signals may also be selected. The 
registers 258 and 260 may be reset by one of the global control 
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signals GCNTRL[3:0] or a signal from the output control channel 
254. 

The tristate output buffer 2 62 may be enabled 
asynchronously or synchronously by one of the global output control 
signals GCNTRL [ 3 : 0 ] or a signal from the output control channel 
254. The tristate output buffer 262 may have a slew rate control 
input that may receive a signal from the slew rate control circuit 
264. The slew rate control circuit 264 may be configured to 
control the slew rate of the output buffer 2 62 in response to a 
portion of the control signal C a _ n . The bus hold circuit 2 66 may be 
configured to present a signal to the output of the macrocell 256 
in response to a portion of the signal C a _ n . 

Referring to FIG. 14, a block diagram of the PLD 100 
illustrating the grouping of I/O blocks into I/O banks. The 
separate I/O blocks allow for improved design "fitting" , pinout 
flexibility, and I/O performance. The I/O blocks may be combined 
to form I/O banks that may be configured to support all of the 
current I/O standards within the 1.5V to 3.3V range. Each I/O bank 
may have an independent supply voltage input (e.g., VCCO) and a 
voltage reference input (e.g., VREF) . I/O cells within a bank may 
be configured for different standards that (i) have a common VCCO, 
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and (ii) have a common VREF. I/O standards that generally do not 
require a VREF may coexist with standards that do need a VREF. 
When a VREF is not required, the pin may be configured as an input 
or output . 

While the invention has been particularly shown and 
described with reference to the preferred embodiments thereof, it 
will be understood by those skilled in the art that various changes 
in form and details may be made without departing from the spirit 
and scope of the invention. For example, the size and number of 
logic blocks, logic block arrays, single port RAM, dual port RAM, 
FIFO RAM, I/O blocks, PLLs, and routing channels could be smaller 
or larger. The architecture could be extended to more levels of 
hierarchy, for example, there could be groups of logic block arrays 
that could be connected with another level of vertical and 
horizontal routing. The architecture may be constructed without 
some of the base components like the PLL 110, the regulator, the 
memory 116, or the memory 13 0. 



